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Abstract. Computer-aided individual recognition of animals based on their natural markings has become an indispensa- 
ble tool in ecology research. However, this is problematic in species with faint patterns. Here we test whether individually 
reared larvae of the European Fire Salamander (Salamandra salamandra) can be reliably recognized from images of their 
lateral tail patterns. We used Wild-ID software to (1) estimate the recognition uncertainty resulting from pre-processing 
the images, (2) quantify how pre-contrasting improves recognition, (3) assess the effect of ontogenetic pattern change on 
recognition until metamorphosis, and (4) test how recognition performs with larger image libraries. Our results show that 
discrimination of larvae is highly successful. Pre-processing did not lead to a relevant change in the recognition probability, 
while pre-contrasting even reduced the recognition probability. The shorter the time interval between two photos, the more 
readily an individual will be recognized. The overall recognition rate was 99.81%, with false rejection rates (FRR, calculated 
as the number of falsely rejected images divided by the number of matching attempts) amounting to 4.66, 0.77 and 0.20% 
for FRR, (first image provided by Wild-ID does not match), FRR, (none of the first ten images provided by Wild-ID match- 
es) and FRR, (none of the first 20 images provided by Wild-ID matches), respectively. These rates are among the lowest ever 
reported. The inclusion of images of 130 wild-caught larvae did not negatively affect successful individual recognition. Au- 
tomated photo-identification may therefore be considered a reliable tool for fieldwork on European Fire Salamander larvae. 


Key words. Amphibia, Caudata, false rejection rate, ontogenetic pattern change, photographic capture-recapture, tail pat- 
tern, Wild-ID. 


Zusammenfassung: Die computergestützte individuelle Erkennung von Tieren anhand ihrer natürlichen Muster ist zu einem 
unverzichtbaren Instrument in der Ökologie geworden. Bei Arten mit nur schwachen Mustern ist dies jedoch problematisch. 
Wir untersuchen in dieser Studie, ob individuell aufgezogene Larven des Europäischen Feuersalamanders (Salamandra sala- 
mandra) anhand von Bildern ihrer seitlichen Schwanzmuster zuverlässig erkannt werden können. Wir verwendeten Wild- 
ID, um (1) die aus der Vorbearbeitung der Bilder resultierende Erkennungsunsicherheit abzuschätzen, (2) zu quantifizieren, 
wie unterschiedliche Vorkontrastierungen die Erkennung verbessern, (3) die Auswirkung der ontogenetischen Musterver- 
änderung auf die Erkennung bis hin zur Metamorphose zu beurteilen und (4) zu testen, wie die Erkennung bei größeren 
Bildbibliotheken funktioniert. Unsere Ergebnisse zeigen, dass die Unterscheidung von Larven sehr erfolgreich ist. Die Vor- 
bearbeitung führte zu keiner relevanten Veränderung der Erkennungswahrscheinlichkeit, während die Vorkontrastierung 
die Erkennungswahrscheinlichkeit sogar reduzierte. Je kürzer das Zeitintervall zwischen zwei Fotos ist, desto leichter ist die 
Erkennung. Die Gesamterkennungsrate lag bei 99,81 %, wobei die Raten falschlich zurückgewiesener Bilder (FRR, berechnet 
als Anzahl der falsch zurückgewiesenen Bilder geteilt durch die Anzahl der vorgenommenen Tests) zwischen 4,66 %, 0,77 % 
und 0,20 % für FRRi (erstes von Wild-ID geliefertes Bild stimmt nicht überein), FRRıo (keines der ersten zehn von Wild-ID 
gelieferten Bilder stimmt überein) bzw. FRR21 (keines der ersten 20 von Wild-ID gelieferten Bilder stimmt überein) lagen. 
Diese gehören zu den niedrigsten jemals beobachteten Fehlerraten. Die Einbeziehung von Bildern von 130 wild gefangenen 
Larven hatte keinen negativen Einfluss auf die erfolgreiche Erkennung von Individuen. Insofern ist die automatische Foto- 
identifizierung ein zuverlässiges Instrument für die Feldarbeit mit Larven des Europäischen Feuersalamanders. 


Introduction A wide range of estimation techniques exist, of which those 

for open populations, such as the Cormack-Jolly-Seber 

Reliable estimates of a species’ population size are of ut- method, are considered the most powerful (JOLLY 1965, 
most importance for conservation biology (WHITE 2000). SEBER 1965, PLEDGER et al. 2003). Open population ap- 
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proaches estimate the population size based on individu- 
al capture-recapture histories. Therefore, individual-based 
recognition of specimens in the wild is a requirement for 
solid population size estimation, making the choice of the 
marking method a crucial factor in planning a capture- 
mark-recapture (CMR) study. There is a wealth of tech- 
niques for marking vertebrates in the wild (e.g., SıLvy et al. 
2012), but only some of them are practicable for amphibians 
(FERNER 1975, HEYER et al. 1994). Most of these marking 
techniques are considered invasive and may negatively af- 
fect the survival rates of the marked individuals [e.g., May 
2004, LUNGHI & VEITH 2017, WAGNER et al. 2020a). This 
leaves fieldworkers with the necessity to carefully identify 
the most appropriate method by assessing both ethical re- 
sponsibility and scientific validity (PERRY et al. 2011, SILVY 
et al. 2012). Therefore, less invasive marking techniques that 
are at the same time cost-effective and reliable are high- 
ly desirable (BEAUSOLEIL et al. 2004). The most harmless 
way of individual identification of wild animals is not to 
mark them, but to use their natural individual ‘markings’ 
via photographical identification (capture-recapture only, 
CR). Such unique colour patterns are present in many am- 
phibian species (SCHLUPMANN & KUPFER 2009). 
Occasionally, common species become the focus of con- 
servation biology. This is increasingly caused by emerging 
infectious diseases (FISHER et al. 2012), as in the case of 
the European Fire Salamander (S. salamandra), which is 
currently threatened by the spread of the chytrid fungus 
Batrachochytrium salamandrivorans (Bsal). For more than 
a decade now, this pathogenic fungus, which is believed to 
be of Asian origin (MARTEL et al. 2014, O'HANLON et al. 
2018), has caused dramatic population declines and mass 
mortality events in The Netherlands, Belgium, Germany 
and Spain (MARTEL et al. 2013, SPITZEN-VAN DER SLUIJS 
et al. 2013, 2016). Therefore, detailed population studies on 
the European Fire Salamander are currently underway to 
monitor population developments in areas where the fun- 
gus is expected to expand its range (LÖTTERS et al. 2020). 
The terrestrial life stages of the European Fire Salaman- 
ders display a unique dorsal pattern with conspicuous, 
usually yellow, markings on a black background, which, 
in combination with their toxic skin components, makes 
them a candidate study object for aposematism in am- 
phibians (PREISSLER et al. 2019). Various studies have al- 
ready proven that this pattern can be used for individual 
identification of adult fire salamanders (BRADFIELD 2004, 
PLAIASU et al. 2005, SCHLÜPMANN & KUPFER 2009, MAT- 
THE et al. 2017, SPEYBROECK & STEENHOUDT 2017). Ironi- 
cally, despite this unique pattern, estimating the size of 
adult fire salamander populations is very time-consuming 
due to their secretive lifestyle (JUNG et al. 2000, HYDE & SI- 
MONS 2001, SCHMIDT et al. 2015). Monitoring of larval pop- 
ulations of European Fire Salamanders is therefore increas- 
ingly used as a proxy for the adult population when infor- 
mation on population trends is needed (SCHLUPMANN & 
KUPFER 2009, REINHARDT et al. 2018), especially in areas 
where population breakdowns due to Bsal have been ob- 
served or are expected (SANDVoss et al. 2020, WAGNER 


et al. 2017, 2020b,c). All these studies have used a removal 
sampling approach as suggested by SCHMIDT et al. (2015). 

WAGNER et al. (2020) tested the performance of the 
removal sampling approach of SCHMIDT et al. (2015) us- 
ing a CMR study based on Visible Implant Alpha tags 
(VIA), which are widely employed to mark amphibians 
(e.g., BUCKLEY et al. 1994, HEARD et al. 2008, OSBOURN 
et al. 2011), including larvae (CourrTots et al. 2013). They 
demonstrated that removal sampling strongly underesti- 
mated population size compared to CMR estimates. Apart 
from the fact that their recaptured marked larvae had sig- 
nificantly lower body indices compared to newly captured 
ones (so marking with VIA may be too invasive to be ap- 
plied to fire salamander larvae), some larvae had lost their 
tags or their tags were barely legible through the skin in 
which cases they successfully used photos of lateral tail 
patterns to re-identify individuals (WAGNER et al. 2020<). 

Larval fire salamanders possess a unique and character- 
istic, albeit faint black tail pattern on a more or less grey- 
ish background. However, no comprehensive study regard- 
ing the use of larval tail patterns for individual recogni- 
tions has been conducted as yet. Only in a few instances 
have tail images been matched visually (e.g., WAGNER et al. 
2020a), which is time-consuming and, according to EITAM 
& BLAUSTEIN (2002), only feasible when there are few (i.e., 
< 20) individuals. In addition, when the larvae approach 
metamorphosis, their tail patterns transform from a grey- 
ish ground colour to the species-specific black-yellow pat- 
tern (GIESENBERG 1991), which makes continued recogni- 
tion nearly impossible. 

Here we test if the lateral tail pattern of larval Europe- 
an Fire Salamanders can be used for computer-aided pho- 
tographic capture-recapture studies (PhCR; BOLGER et al. 
2012). If successful, this would pave the way for large-scale 
application in PhCR studies of larval populations. In par- 
ticular, we ask if, and to which extent, the ontogenetic al- 
teration of the tail pattern will over time reduce recogni- 
tion efficiency. We also assess how metamorphosis influ- 
ences recognition results, and up to which ontogenetic 
stage recognition is possible. Furthermore, we hypothesise 
that pre-contrasting of tail images may improve individual 
recognition. Finally, we test the assumption that increas- 
ing the size of the image library by adding images of wild- 
caught larvae will reduce individual recognition probabil- 
ity and increase the false recognition rate (FRR; BOLGER 
et al. 2012). 


Materials and methods 


In 2018, we captured 40 newborn larvae of the European 
Fire Salamander (S. s. terrestris) (under license from the 
‘Struktur- und Genehmigungsdirektion (SGD) Nord’ of the 
Federal State of Rhineland-Palatinate, Germany) in a first- 
order creek in the Hunsrück Mountains (Rhineland-Palati- 
nate, Germany; 50°11’00” N, 7°37'45” E). Nine larvae were 
captured on 22 April, and the other ones one week later (see 
Supplementary document S1). They were kept one by one, 
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each in 5 litres of tap water, to preclude the possibility of in- 
dividual mix-ups. The temperature was kept constant in a 
climate chamber at 15°C, with a 12/12-hour day/night cycle. 
The water was completely replaced on a weekly basis (aged 
tap water), and the larvae were fed ad libitum with sludge 
worms (Tubifex tubifex). Depending on their growth and 
ontogenetic development, the larvae remained in the exper- 
iment for up to ten weeks. From the beginning of the sec- 
ond week, each larva was photographed once a week with a 
specially adapted camera/tripod construction to document 
the change of their tail patterns. For this purpose, they were 
each placed in a small rectangular glass cuvette filled with 
water (Iwh = 8 x 3 x 4 cm), which was placed in a spe- 
cial bracket equipped with LEDs. Both sides of the animals 
were photographed parallel to the camera with a Nikon 
D7o with standard zoom lens (28-80 mm focal length, ap- 
erture 1: 3.3-5.6 G), but without flash. We consider left and 
right tail images as two more or less independent datasets 
of the same individuals, since there is no indication that left 
and right tail patterns develop equally, apart from the gen- 
eral background colour and dot density. To further test the 
performance of automated identification, we extended the 
dataset of the 40 test animals with images of another 130 
wild-caught larvae captured in the same creek in May 2018 
(photographed and released on site). 


Wild-ID (1.0.0) 


Wild-ID (1.0.0) is a Java-based matching software that rec- 
ognises individual patterns in images, compares them with 
patterns in other images and finds matches (BOLGER et al. 
2011). Unlike other pixel-based matching software, Wild- 
ID uses a feature-based ‘Scale Invariant Feature Transform’ 
(SIFT) algorithm. To some extent, this algorithm tolerates 
variations in image scale, rotation, illumination and cam- 
era viewpoint (LOWE 2004). The feature-based matching- 
algorithm proceeds across the following three steps: (1) the 
image is examined for characteristic, invariant features, so- 
called key points; (2) the properties of these features are 
summarised in so-called feature descriptors; (3) the de- 
scriptors of an image are compared with those of other im- 
ages during the matching process (NISCHWITZ et al. 2011, 
BOLGER et al. 2012). 

Based on the similarity of the descriptors, a score rang- 
ing from 1.0 (complete concordance) to 0.0 (no concord- 
ance) is determined during the matching process of two 
images (BENDIK et al. 2013). The use of Wild-ID is gener- 
ally divided into two parts: first, the steps outlined above 
are performed automatically by the software. In a second 
step, the user manually evaluates the 20 potential matches 
proposed by the software. 


Data processing 


To prepare the photos for Wild-ID, they were aligned hor- 
izontally alongside the shape of the tail of the larvae us- 
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ing Adobe Photoshop Elements 11. Then the part of the 
tail from the base of the hind leg to the tip of the tail was 
cropped at a ratio of 10 : 3.5. This left only the part of the 
image relevant for further investigation: the tail of the lar- 
va, which is characterised by an individual dot pattern (E1- 
TAM & BLAUSTEIN 2002). 


Contrast versions 


Wild ID is mainly used for species with conspicuous, 
high-contrast patterns. We therefore assumed that faint 
patterns, such as the lateral tail pattern of fire salamander 
larvae, might cause problems in the automatic recogni- 
tion of individuals and that enhancing the contrast of an 
image before analysing it with Wild ID would increase the 
identification success. We therefore compared three dif- 
ferent contrast versions (Fig. 1): (1) the original contrast 
version (ORG; no additional contrast), (2) a high-contrast 
colour version (COL), and (3) a binary greyscale version 
(BGS). 

Contrast version 2 (COL) was generated by maximis- 
ing contrast and brightness (+40%) and acuity (+50%) with 
Microsoft PowerPoint. The image was then saved and re- 
loaded, again maximising contrast and brightness (+40%). 
Contrast version 3 (BGS) was also produced by maximis- 
ing contrast and brightness (+40%) and recolouring (grey- 
scale: 25%). This step was effected with Microsoft Power- 
Point 2010 (EITAM & BLAUSTEIN 2002). The images were 
saved as JPEGs with 220 dpi in Microsoft PowerPoint (E1- 
TAM & BLAUSTEIN 2002). 


Editing precision - pre-test 


The precision with which a user aligns and crops the imag- 
es during processing may lead to a background variation of 
the matching coefficients, the possible impact of which we 
investigated in a pre-test. One image each of ten random- 
ly selected larvae (from the same randomly selected week) 
was aligned and cropped ten times each. They were then 
contrast-enhanced according to their contrast version. The 
first image of an individual was compared to the respective 
nine other images, resulting in three datasets (one per con- 
trast version) containing 90 images each. These were ana- 
lysed separately with Wild-ID to test how image processing 
itself affects matching results. 


Compilation of datasets 


Fire salamander larvae change their background colour 
over time; the density of melanophores increases so that 
the dot patterns become darker and darker during ontoge- 
ny (e.g., PEDERZOLI et al. 2003). To investigate whether this 
ontogenetic pattern change causes the recognition proba- 
bility to decrease over time, the first image of a larva (taken 
in week 2) was compared with the images of this individu- 
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al from the following weeks (‘change-over-time’ test). This 
was done separately for the left and right tail sides; again 
we tested the performance of the three different contrast 
versions. 

The ultimate goal of a pattern recognition software like 
Wild-ID is to detect and record the individual characteris- 
tics of captured animals. Therefore, a dataset usually con- 
tains all images taken during a field study. We simulated 
this scenario (‘reality test’) by compiling a data set con- 
taining all images collected from all laboratory individuals 
over time, as well as 130 images of the wild-caught larvae. 
This ‘reality test’ was performed separately for the right 
(431 photos) and left (429 photos; two images had been 
lost) tail sides. 


Statistics 


Automated species identification with Wild-ID is primarily 
aimed at identifying individuals. If the pattern of the speci- 
men in question is very clear and unique and the pattern 
has already been stored in the image database, easy recog- 
nition can be expected. Ideally, the first image that the soft- 
ware selects from the image database should already match, 
so that a false rejection of this image as not matching the 
photos from the database is unlikely. Therefore, the false 
rejection rate (BOLGER et al. 2012) quantifies the ease of 
finding a matching photo. In addition, the matching scores 
quantify the degree to which two images match. 

Matching scores were tested for normal distribution 
(Shapiro- Wilk tests). Since normal distribution was reject- 
ed in almost all instances (see Supplementary document 


left 


S2), we consistently applied non-parametric tests to guar- 
antee equal test powers. To compare matching scores be- 
tween treatments, we used the Friedman test for repeated 
measurings in cases where the three contrast versions of 
the same images were compared, including a post hoc test 
for paired comparisons with sequential Bonferroni cor- 
rection for multiple testing; in the few cases where images 
of only two contrast versions were available, we used the 
Wilcoxon test instead. For the comparison of independent, 
non-repeated matching scores we used the Kruskal-Wal- 
lis test, complemented by a sequential Bonferroni post hoc 
test for multiple comparisons. All tests were performed us- 
ing IBM SPSS*. 


False rejection rate (FRR) in the ‘reality test’ 


In addition to measuring the matching scores, we also 
wanted to know how easily Wild-ID correctly identifies in- 
dividuals already known. Therefore, we estimated the False 
Rejection Rate (FRR) to assess the overall identification 
performance of Wild-ID. The FRR represents the probabil- 
ity at which the matching software fails to match two im- 
ages of the same individual within a given number of best- 
matching photos divided by the number of identification 
attempts as a percentage and corresponds to the false nega- 
tive error (BOLGER et al. 2012). We estimated three differ- 
ent FFR versions: FRR, refers to cases where Wild-ID was 
unable to find a true match at first; FRR, refers to cases 
where no true matches were found among the ten highest 
ranking images; FRR, refers to cases where Wild-ID was 
unable to assign a correct image of the same individual to a 


right 


Figure 1. Comparison of contrast versions. Three contrast versions of the left and right tail side of the same individual: original version 
(ORG; upper), high-contrast colour version (COL; middle), binary greyscale version (BGS; lower). 
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Table 1. FRR-values of studies using Wild-ID and/or amphibians; ” in these studies, FFR was calculated in a slightly different man- 
ner: number of failures to match two images of the same individual within a given number of best-matching photos divided by the 
number of successful identification attempts (as a percentage); ’ FFR calculated by ourselves based on the data presented in the paper. 


Species FRR-value Software Image sublibrary Reference 

[%] 
FFR21 
European Fire Salamander (Salamandra salamandra) 0.19  Wild-ID original contrast version this study 
Jollyville Plateau Salamander (Eurycea tonkawae) 0.76  Wild-ID high quality photos BENDIK et al (2013) 
Jollyville Plateau Salamander (Eurycea tonkawae) 15.9 Wild-ID low quality photos BENDIK et al (2013) 
Masai Giraffe (Giraffa camelopardalis tippelskirchi) 0.67 Wild-ID BOLGER et al. (2011) 
Tuatara (Sphenodon punctatus) 2.4 Wild-ID right side SA ROCHA MELLO et al. 

(2019) 

Pacific Horned Frog (Ceratophrys stolzmanni) 5 Wild-ID BARDIER et al. (2020) ! 
FRRIO 
European Fire Salamander (Salamandra salamandra) 0.77 Wild-ID original contrast version this study 
Great Crested Newt (Triturus cristatus) 2 Amphldent DRECHSLER et al. (2015)! 
Marbled Salamander (Ambystoma opacum) 5 own software GAMBLE et al. (2008) 
Strinati’s Cave Salamander (Hydromantes strinatii) 4.2 Wild-ID cloaca/chest RENET et al. (2019) 
FRRI 
European Fire Salamander (Salamandra salamandra) 4.66 Wild-ID original contrast version this study 
Alpine Newt (Ichthyosaura alpestris) 0.58  Wild-ID males METTOURIS et al. (2016) ? 
Alpine Newt (Ichthyosaura alpestris) 1.66  Wild-ID females METTOURIS et al. (2016)? 
Smooth Newt (Lissotriton vulgaris) 759  Wild-ID males METTOURIS et al. (2016)? 
Smooth Newt (Lissotriton vulgaris) 18.6 Wild-ID females METTOURIS et al. (2016) 
Marbled Salamander (Ambystoma opacum) 30 own software GAMBLE et al. (2008) 
Strinati’s Cave Salamander (Hydromantes strinatii) 8.2 Wild-ID cloaca RENET et al. (2019) 
Strinati’s Cave Salamander (Hydromantes strinatii) 9.3 Wild-ID chest RENET et al. (2019) 


test image within 20 potential matches (BOLGER et al. 2012, 
DRECHSLER et al. 2015). FFR values were calculated from 
the data set we had compiled for the reality test (see above). 


Results 
Pre-test 


Our pre-test produced the best results using the ORG ver- 
sion (Fig. 2), with an average score of 0.974 across all indi- 
viduals (the maximum value of 1.0 was achieved in 81% of 
all matches), followed by the COL version (0.909; 71% with 
a score of 1.0), and the BGS version (0.872; 70% with a score 
of 1.0). When analysed together, the matching scores dif- 
fered significantly between the three versions (Friedman 
test); however, none of the post hoc pairwise comparisons 
remained significant after Bonferroni correction, with a ten- 
dency of ORG producing higher scores than BGS (p=0.063) 
(Supplementary document S2). 


‘Change-over-time test’ 


Wild-ID performed best with ORG over time and worst 
with the BGS. All contrast versions showed decreasing 
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scores with the more time had elapsed between two im- 
ages (Fig. 3). The Kruskal-Wallis test with post hoc Bon- 
ferroni correction for the comparisons over weeks almost 
always revealed highly significant differences for ORG (p 
< 0.001); interestingly, the decrease in matching scores be- 
tween the respective first pair of comparison (e.g., week 2 
with 3 versus week 2 with 4) was not significant (Supple- 
mentary document S2). The other two contrast versions 
exhibited less pronounced differences over time (Supple- 
mentary document SI 2). Friedman tests of post hoc pair- 
wise comparisons of contrast versions within weekly com- 
parisons always showed higher scores for ORG compared 
to COL and BGS, but only a slight significance for COL 
and BGS in week 2 with week 3 comparisons (Supplemen- 
tary document S2). 


Comparison of body sides 


Matching scores for the left and right tail side of an indi- 
vidual can vary (e.g., ORG for week 2 with 3 of individual 2: 
0.605 and 0.129 for the left and right side, respectively). Their 
median scores across all individuals and weeks are similar 
(Fig. 4), with no significant difference in all three contrast 
versions (all p > 0.05; see Supplementary document SI2). 
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Figure 2. Matching scores of the pre-test (ORG: original version, COL: high-contrast colour version, BGS: binary greyscale version). 
Scores of ten independently edited images from ten randomly selected individuals (pre-test) are given; average matching scores are 


indicated by crosses; median (-) and mean (+) matching scores are given. 
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Figure 3. Box-whisker plots with median (-) and mean (+) matching scores of weekly comparisons (ORG: original version, COL: high- 


contrast colour version, BGS: binary greyscale version). Scores for left and right sides are averaged. 
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‘Reality test’ with wild-caught larvae 


Only one image of the right tail side could not be matched. 
For all images of the left tail side, a matching image was 
identified within the 20 possible matches offered by Wild- 
ID. For all other images of the 40 individuals, Wild-ID was 
able to find correct matches, even for those images from 
later weeks that could not be successfully matched in the 


‘change over time’ test. This resulted in an average success 
rate of 99.81% for both sides, with FRR at 4.66%, FRR, , at 
0.77%, and FRR, at 0.20% (Fig. 5). The lowest score of an 
image that matched correctly within the 20 images sug- 
gested by Wild-ID was 0.00002, while the highest score for 
an image that did not match within the first 20 images was 
0.5. Nevertheless, there was almost no overlap in scores be- 
tween matching and non-matching images (Fig. 6). 
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Figure 4. Box-whisker plots with median (-) and mean (+) matching scores for the left and right tail sides (ORG: original version, 
COL: high-contrast colour version, BGS: binary greyscale version). Scores are given for weekly comparisons. 
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Figure 5. Cumulative amounts of correct matches. Correct matches are given with increasing matching rank from amongst 20 images 
of left versus right tail side as suggested by Wild-ID as potential correct matches; the boxplots quantify the matching scores across 


matching ranks. 
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Figure 6. Comparison of correct and false matches. Matching 
scores of the images suggested correctly or incorrectly by Wild- 
ID within the first 20 images. 


Discussion 


The use of natural markings for the recognition of individ- 
uals within wild animal populations in PhCR has become 
an important tool for a variety of biological studies, such as 
individual behaviour, population demography, and popu- 
lation dynamics (BOLGER et al. 2012). Its non-invasiveness 
(SıLvy et al. 2012) and its usually low costs (CRUICKSHANK 
& SCHMIDT 2017) make it superior to invasive marking 
techniques, and automation through specialized imag- 
ing software significantly reduces the time for re-identify- 
ing individuals from a large image library (BOLGER et al. 
2012). While it seems obvious that photographic identifica- 
tion can be associated with low error rates in species that 
display conspicuous and high-contrast patterns (among 
amphibians, e.g., the Bellied Toads (genus Bombina; e.g., 
Vörös et al. 2007, GOLLMANN & GOLLMANN 2011, CRUICK- 
SHANK & SCHMIDT 2017) or the European Fire Salaman- 
der (e.g., CARAFA & BIONDI 2004, GOEDBLOED et al. 2017), 
animals with faint patterns may pose problems in PhCR. 
However, our test of Wild-ID’s performance on tail photos 
of European Fire Salamander larvae clearly demonstrates 
that despite the sometimes very low matching scores, in- 
dividual identification is highly reliable and can be used in 
studies on wild populations. 

Our pre-test shows that the processing of images may 
already lead to a - albeit small - variation of matching 
scores. However, the average of 0.974 for the original (un- 
altered) contrast version is still close to the highest pos- 
sible score of 1.0. Interestingly, the original image version 
already achieved significantly better results than the two 
processed contrast versions in the pre-test. However, this 
difference was not significant, so that our hypothesis that 
prior contrast-enhancing improves matching scores can 
be rejected. This is maybe due to the fact that Wild-ID it- 
self performs a greyscale transformation prior to the actual 
scoring. The fact that the binary greyscale (BGS) version 
performed worst is probably due to the fact that too much 


information of the image is lost during the transformation. 
Other studies also recommend the use of multicolour rath- 
er than binary images (MATTHE et al. 2017). In contrast, 
the slight reduction in the matching scores of the original 
version caused by the alignment and cropping of images 
can hardly be avoided, since cropping the images is nec- 
essary and strongly recommended when using Wild-ID 
(BOLGER et al. 2011). It minimizes possible sources of error, 
namely the background as well as colour- and pattern-in- 
variant image parts (BOLGER et al. 2011, ELGUE et al. 2014). 
Therefore, alignment and cropping cannot be dispensed 
with, and given the overall low matching scores of pictures 
taken of the same individual at different times, this slight 
reduction due to processing seems negligible. 

The decreasing matching scores over time confirm our 
assumption that the shorter the time between two pho- 
tos, the more readily the recognition of individuals will be 
achieved. We expected that time would affect pattern rec- 
ognition, since it is known that fire salamander larvae con- 
tinually change their background colour and dot pattern 
(e.g., PEDERZOLI et al. 2003), a fact that a priori contradicts 
one of the basic conditions of PhCRs, namely that an in- 
dividual’s pattern is stable over the duration of the study 
period (BOLGER et al. 2012). This phenomenon has also 
been described for other amphibian species (ARNTZEN & 
TEUNIS 1993, GOLLMANN & GOLLMANN 2011, DRECHSLER 
et al. 2015, BARDIER et al. 2017). Other studies have found 
that changes in body shape and weight, but also growth or 
other reasons are responsible for the negative correlation 
between time and recognition probability (BENDIK et al. 
2013, METTOURIS et al. 2016). 

In particular, the trend towards significantly worse re- 
sults from week 7 onwards suggests that the decreasing rec- 
ognition rate is due to the metamorphosis of the larvae (E1- 
TAM & BLAUSTEIN 2002). During this phase, the black dots, 
which represent dense groups of epidermal melanophores, 
become larger and finally merge into the black background 
colour of the larvae. Depending on the water temperature, 
metamorphosis starts after 40 to 120 days (SEIDEL & GER- 
HARD 2016). Our laboratory larvae entered metamorpho- 
sis at an age of about seven weeks (= 50 days). This still 
does not mean that our method is at a disadvantage com- 
pared to invasive marking techniques such as Alpha or VIE 
tags. WAGNER et al. (2020a) pointed out that the intensified 
coloration of the skin of salamander larvae shortly before 
metamorphosis also makes Alpha tags less visible. Further- 
more, such tags may disappear or migrate to different parts 
of the body during metamorphosis (GARNT 2008, BRAN- 
NELLY et al. 2013, COURTOIS et al. 2013). The adverse ef- 
fect of the increasing obscuration of the tail pattern toward 
metamorphosis can be compensated, at least partially, by 
using images of both sides of the tail. We were able to show 
that they did not differ significantly in matching probabil- 
ity and thus can be considered an independent second set 
of characters for identification. 

The significantly different scores of correct and false 
matches suggest that the scores in themselves can serve as 
an informative indicator for the individual identification of 
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larval fire salamanders. A score of > 0.250004 may, at least 
under the specific conditions of our test situation, serve as 
a threshold to decide whether two images match. Other 
studies may find different thresholds; BENDIK et al. (2013) 
were able to set a matching threshold of 0.1 when they pre- 
tested the dorsal melanophore pattern of the heads of a 
small number of Jollyville Plateau Salamanders (Eurycea 
tonkawa) with VIE-Tags and Wild-ID. We, therefore, rec- 
ommend that should be determined for any given study 
species by prior re-identifying a small group of known in- 
dividuals. This may considerably improve the evaluation of 
the entire picture library: matches that exceed this thresh- 
old can almost certainly be considered correct and only 
matches with lower scores must be checked by hand. In- 
terestingly, in our study, Wild-ID incorrectly matched the 
photos of two apparently very different larvae with a score 
of ca. 0.5, which is very much > 0.250004. But relying solely 
on matching scores would have resulted in an undetect- 
ed false-positive match. In a real study situation, the user 
would have immediately recognised the great difference 
between the images if it had been suggested by Wild-ID as 
one out of 20 potential matches. 

Even under a more realistic study situation (all pictures 
of all laboratory larvae were merged into a single picture 
library together with the images of the 130 wild-caught 
larvae), the success rate of identification was 99.81%. Our 
hypothesis that ‘the larger the dataset, the lower the rec- 
ognition rate’ must therefore be rejected, at least for our 
library size of ca. 430 images. This is consistent with the 
results of other studies where Wild-ID was used for indi- 
vidual recognition (BOLGER et al. 2012, METTOURIS et al. 
2016) and leads us to conclude that Wild-ID is a suitable 
tool for identifying larvae of the European Fire Salaman- 
der (S. salamandra) even from large picture libraries. This 
is based on the very low FRR rates we found, although the 
matching scores themselves were often very low. However, 
as populations of salamander larvae can be very large (e.g., 
WAGNER et al. 2020; see also VEITH et al. 2022 for a re- 
view), successful use of the lateral tail pattern of larvae for 
individual recognition still needs to be tested under field 
conditions. 

To better relate our results, we have compared our FFR 
values with those of other studies (Table 1) using Wild- 
ID or working with amphibians. It appears that our FRR, 
and FRR, values were the lowest compared to other stud- 
ies. Only our FRR value of 4.66% was in the range of cor- 
responding values in other studies. We therefore recom- 
mend considering not only Rank 1 matches but at least 
also Ranks 2-10. MATTHE et al. (2017) also encountered an 
improvement in the performance of about 26% when con- 
sidering the top 10-ranked matches instead of single Rank 
1 match. Nevertheless, incorrect matching can still occur. 
PhCR studies with such low error rates, however, produce 
altogether reliable results, e.g., for population estimation 
studies (MORRISON et al. 2011). 

There are several reasons why PhCR performed better 
in our study compared to other studies. Firstly, the SIFT- 
algorithm used by Wild-ID is considered an advantage 
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over a pixel-based algorithm, such as used by Amphldent, 
because it is more tolerant of variations in rotation, angle, 
scale and illumination (LOwE 2004, MATTHE et al. 2017). 
Image quality is also very important (BENDIK et al. 2013). 
The same is true for the time between recaptures, which is 
known to negatively affect the recognition rate (BENDIK et 
al. 2013, METTOURIS et al. 2016) and which was compara- 
tively short in our study. In addition, changes in the pos- 
ture, shape and size of the animals (GAMBLE et al. 2008) as 
well as glare and flash may contribute to recognition un- 
certainty (MATTHE et al. 2017). The latter was avoided in 
our study by using a special LED illumination system, by 
wiping off water droplets from the photo cuvette and by us- 
ing a rectangular instead of a round cuvette. Another very 
important factor influencing the recognition success is the 
type and variation of the species-specific pattern itself. 
ELGUE et al. (2014) demonstrated that the highly variable 
ventral pattern of individuals of Melanophryniscus monte- 
vidensis produced better matching results compared to the 
less variable M. admirabilis and M. cambaraensis. A high- 
er number of dots (as in Ichthyosaura alpestris compared 
to Lissotriton vulgaris) or larger dots (as in male L. vulga- 
ris compared to female L. vulgaris) may also lead to better 
matching results (METTOURIS et al. 2016). 

In our study setup, recognition is even possible across 
later weeks, when metamorphosis has already started, al- 
beit at low rates. This will increase if not only week II, but 
also later photos are compared with pictures taken shortly 
before metamorphosis, as is indicated by our reality test 
where all images of each larva were successfully matched. 
Nevertheless, the time intervals between two capture 
events should be kept as short as possible, in particular, if 
the animals patterns change over time (e.g., in metamor- 
phosing larvae; BENDIK et al. 2013). 

In conclusion, a computer-assisted photographic iden- 
tification of European Fire Salamander larvae (S. salaman- 
dra) is not only possible with Wild-ID (and probably also 
with other image-matching software), but also very relia- 
ble. It has many advantages over other marking techniques, 
such as fast identification both in terms of image produc- 
tion and matching. Apart from the necessary hardware, it is 
inexpensive and most importantly less invasive than other 
techniques (Caorst et al. 2012, SUKALO et al. 2013, WAGNER 
et al. 2020a). Nevertheless, especially with small animals, 
taking pictures can already require a considerable amount 
of handling, either by photographing them in-hand or by 
using a manipulation or fixation tool. This reintroduces the 
potential stress that the non-invasive technique is intended 
to avoid (TREILIBS et al. 2016). In our approach, the use of 
specially adapted photographic equipment, with a water- 
filled cuvette mounted on a tripod, reduces stress and even 
guarantees that the photos depict the animals in a more or 
less standardised position. In a slightly modified form, this 
tripod setup has already been successfully used under field 
conditions by WAGNER et al. (2020<). 

Alignment and trimming of photos are highly recom- 
mended, but editing the contrast of photos prior to using 
them for identification in a photo library, as we did, is not 
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only unnecessary - it will even reduce matching scores. 
If the time interval between two recaptures is short, e.g., 
about a week, the match is not negatively affected by ongo- 
ing metamorphosis, and the larvae can be identified until 
they leave the water. If feasible, a pre-test will help to de- 
termine study-specific matching thresholds that can speed 
up the identification process when working with large pho- 
to libraries. Automated PhCR will therefore substantially 
improve the efficiency of population studies of larval fire 
salamanders and thus can significantly contribute to the 
monitoring of endangered salamander populations in ar- 
eas where the salamander plague caused by Bsal occurs or 
is expected to occur. 
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